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Abstract 

In the Kingman coalescent tree the length of order r is defined as the sum of the lengths of 
all branches that support r leaves. For r = 1 these branches are external, while for r > 2 they 
are internal and carry a subtree with r leaves. In this paper we prove that for any s € N the 
vector of rescaled lengths of orders 1 < r < s converges to the multivariate standard normal 
distribution as the number of leaves of the Kingman coalescent tends to infinity. To this end we 
use a coupling argument which shows that for any r > 2 the (internal) length of order r behaves 
asymptotically in the same way as the length of order 1 (that is the external length) . 
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1 Introduction and main result 



The Kingman coalescent was introduced in [Ki82j as a model for describing the genealogical 
relationships between the individuals for a wide class of population models (see |Wa08| for 
details). The state space of the Kingman n-coalescent, n £ N, is the set V n of partitions of the 
set {1,2, ... ,n}. The process starts in the partition into singletons ir n = {{1}, . . . , {n}} and has 
the following dynamics: given that the process is in the state nt, it jumps after a random time 
Xk to a state tt^-i which is obtained by merging two randomly chosen elements from tx^. The 
random inter-coalescence times Xk are independent, exponentially distributed random variables 
with parameters ({;)• The process can be viewed graphically as a rooted tree that starts from n 
leaves labelled from 1 to n and whose any two branches coalesce independently at rate 1. Each 
branch of this tree is situated above a subtree. If this subtree has r leaves, we say that the branch 
is of order r. The branches of order r > 2 are the internal branches, while those of order 1 are the 
external ones (they support subtrees consisting of just one node). 
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Figure 1: The dashed (red) branch is an internal branch of order 4, it supports the leaves 1, 2, 
3 and 4. It is formed at level <r(l,...,4) = 5 and ends at level p(l,...,4) = 3. Its length is 
<Si,2,3,4 = X^ + X§. The dotted (green) branches are the branches of order three. The numbers 
of branches of orders 1 to 10 at level 5 are W 5 (l) = 3, W 5 (2) = 0,W 5 (3) = 1,W 5 (4) = 1 and 
W 5 (i) = for i > 5. 

Let us look at the tree from the leaves towards the root. Then the branch of order r supporting 
the leaves formed at the level a{i\, ■ ■ ■ , i r ) and ends at level p(i%, ■ ■ ■ , i r ), where 

a(i\, . . . , i r ) = max{l < k < n : {ii, . . . , i r } £ irk} and 

p(ii, . . . ,v) = max{l < k < a{n, . . . ,i r ) : . . . ,i r } £ Ti k }. 

For a subset . . . ,i r } of leaves, which is not supported by some branch (which means that 
{ii, . . . , i r } ^ 7Tfc for all k) we set . . . ,i r ) = p(ii, . . . ,i r ) = n. 
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Let j r denote the length of the branch of order r that supports the leaves i\, . . . ,i r and 
write C n ' r for the total length of order r. Then, 

a(ii,...,i r ) 

Si u ...,i r = y~] Xi 

l=p(il,...,i r )+l 

and 

£n,r \ q 

l<i\<- ■ ■ <i r <n 

The length of a randomly chosen external branch in the coalescent tree has been studied by 
Freund and Mohle [FM09] for the Bolthausen-Sznitman coalescent and by Gnedin et al. [GIM08J 
for the A-coalescent. Asymptotic results concerning the total external length of Beta-coalescents 
were given by Mohle [MoelOj for the case < a < 1, by Kersting et al. [KPSJ13] for the case 
a = 1 and by Kersting et al. [K SW12] for the case 1 < a < 2. For the case 1 < a < 2 a weak law 
of large numbers result concerning C n ' r can be easily deduced from Theorem 9 of Berestycki et al. 
[BBS07] and also from Dhersin and Yuan [DY12J. 

Fu and Li [FL93] computed the expectation and variance of the total external branch length 
of the Kingman n-coalescent and Caliebe et al. [CNKR07] derived the asymptotic distribution 
of a randomly chosen external branch. In |JK11| Janson and Kersting obtained the asymptotic 
normality of the total external branch length. Our main result states that the same kind of 
asymptotics holds for the lengths of order r > 1. Moreover, these lengths are asymptotically 
independent. 

Theorem. For any s£N, as n —> oo 



Jj^— i^' 1 - mi, ... , £ n ' s - ^) ^ N (0, / s ) , 

V 41ogn v ' 

where I s denotes the s x s-identity matrix and fi r = E(£ n,r ) = - for every r > 1. 

In a forthcoming paper our theorem will be a main building block for proving a functional limit 
theorem for the total external length of the evolving Kingman-coalescent. 




Figure 2: External length versus internal length of order 2. The plot is based on 1000 coalescent 
realisations with n = 100. 
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The scatterplot for the lengths of orders 1 and 2 in Figure [2] confirms the theorem. The bulk of 
the points are located around the mean (/ii, ^2) = (2, 1)- Also, in this region hardly any correlation 
between the two lengths is visible. The outliers are due to exceptionally long branches whose 
occurrence has been explained in detail in |JK11| for the external case. The simulation shows that 
this phenomenon appears similarly in the case of internal lengths, as one would expect. 

As to the proof of the theorem, for the case s = 1 a hidden symmetry within the Kingman 
coalescent is used in [JKllj. Here we substantially build on this result, however the proof is rather 
different. It consists of a coupling device for Markov chains, which connects the total length of 
order r to the total external length: For 1 < k < n let Wfc(r) denote the number of order r at level 
k, the number of branches of order r among the k branches present in the coalescent tree after 
the (n — k)-ih coalescing event. (Note that here and elsewhere we are suppressing the n in the 
notation.) That is, 

Wk(r) := >V} C {1, . . . ,n} : i x < ■•• < i r ,a{i 1: --- ,i r )>k> p(h,--- 

In particular W n (r) = 0,W n _i(r) = 0, . . . , W n _ r +2(r) = and W\(r) = for r < n. For an 
example see Figure CD 

It is important to notice that for any s 6 N the random vectors (Wfc(l), Wk(2), . . . Wk{s)) form 
a Markov chain if k runs from n to 1 (a property which facilitated our simulations). The transition 
probabilities of the Markov chain are given explicitly in Section 3. The idea of our proof is to 
couple (Wk(r)) n >k>i for 1 < r < s jointly with s independent copies of the Markov chain of 
external numbers (Wfc(l))n>fc>i- Since in addition the length of order r is essentially specified by 
the chain (VFfc( r ))n>fc>i 5 it consequently gets the asymptotical behaviour of the external length. 




a) b) 

Figure 3: a) Simulations of the external numbers Wfc(l) (in orange) and internal numbers Wk(2) 
of order 2 (in blue) for a coalescent with n = 100 for 1 < k < n. The black dashed curves represent 
the expectations as given in Lemma [TJ Figure b) gives the representations in double logarithmic 
scale for a coalescent with n = 10 4 . 



The simulations in Figure [3] give an impression of the behaviour of the lengths of different 
orders. In the range between the levels n and n l ~ e for small e > (closer to the leaves) they 
differ substantially, as seen in Figure [3h.) . This deviation is only due to expectations and does not 
appear at the level of fluctuations. Indeed it is known from |JK11| that for the external length the 
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fluctuations are induced just by the PPfc(l) with n 1_e > k > sfn. As suggested by Figure [3b) in 
this region the evolution of the chains is similar for orders r > 2. The difference in expectation is 
negligible in our construction, as we couple the jumps of the chains and afterwards consider the 
lengths of different orders centred at expectation. 

The interest in the quantities C n ' r arose from models where the population is subject to mutation, 
the mutations being modelled as points of a Poisson process with constant rate | on the branches of 
the coalescent tree. In the infinitely many sites mutation model, in which every new mutation occurs 
at a new locus on the DNA, mutations that are located on the external branches of the coalescent 
tree affect only single individuals, whereas mutations located on an internal branch of order r > 2 
affect all r individuals sitting at the leaves supported by that particular branch. In a population 
of size n let M r (n) denote the number of mutations carried by exactly r individuals. The vector 
(Mi(n), . . . , M n _i(n)), called the site frequency spectrum, and the total number S n := J2r=i M r (n) 
of mutations that affect the population, called the number of segregating sites, are quantities of 
statistical importance. Berestycki et al. [BBS07J obtained a weak law of large numbers for M r (n), 
r > 1, in the case of Beta-coalescents with 1 < a < 2. 

For the Kingman coalescent it is know that the number of segregating sites S n , when rescaled 
by logn, converges almost surely as n — > oo to 6 (see for example Bed!) Theorem 2.11). The 
expectation of M r (n) (which is equal to ~), as well as the variances and the covariances of the 
numbers of mutations M r (n), were computed by Fu [Fu95] and Durrett [Du08j. We obtain the 
following result as a direct consequence of our Theorem. 

Corollary. For any s£N, as n — > oo 

(Mi(n), . . . , M s {n)) A (Mi, ... , M s ), 

where Mi,...,M s are independent Poisson- distributed random variables with parameters 
9 hi, . . . , 8[i s . 



For the proof of the corollary note from the Poissonian structure of the mutation process that 
the characteristic function of (Mi(n), . . . ,M s (n)) is 



¥>(Mi(n),...,M s (n))(Al, • • • , A s ) — E 

= E 



E 



J(XiMi(n)+-+X s M s (n)) i j- 



where T denotes the cr-algebra containing the whole information about the coalescent tree. From 
our Theorem it follows that C n,r — — > fj, r as n — > oo (which can also be deduced from the results of 
Fu |Fu95j ) and therefore 



y(Afi(n),..,JU s (n))(Ai,... ,A S J — ► e MU )... e ^\ > } 

as n — > oo. 

Notation. Throughout c denotes a finite constant whose value is not important and may change 
from line to line. 
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2 Moment computations 



Lemma 1. For the expectation and variance of Wk(r) the following is true. For n> r 

E(W fc (r)) = {n ~ ( k) ' " ( " ~ k - \ + 2) • k(k - 1) and V(W k (r))<c-, 

[n — 1) ... [n — r) n 

where c < oo is a constant depending on r. In particular 

E(W fc (r))= ( — ) ._+0(-)=_+0(_ + 



n / n \n/ n \n" n 
uniformly in k < n. Also, for a > 2 

Proof. In order to compute the moments of M^(r), let us again label the leaves of the coalescent 
tree from 1 to n and note that Wfc(r) can be written as 

W k (r) = 1 {{h,..M^ k }, 

l<ii<---<i r <n 

where 7r^ is the state of the coalescent process at time k. Then for n > r, us- 
ing the fact that the event {{l,...,r} G 71"^} is the disjoint union (over n > Zi > 
/ 2 > ... > i rl > fc) of the events {the branch supporting leaves 1, r is formed by r — 
1 coalescing events happening at levels h,h, ■ ■ ■ , l r -i}, we have from exchangeability that 



E(W fc (r))=E[ Yl iUn,..,^}] = (")p({1,- • • ,r} G 7r fc ) 
\l<h<-<i r <n J ^ ' 



(1) 



r 



/n— r\ (n—l—r\ (h+2—r\ (r\ (h — {r—\)\ 

E \ 2 ) { 2 I \ 2 ) \2) \ 2 ) 

(n\ ' (n-l\ ' ' ' (h+2\ ' (h+l\ ' (h\ 

n>h>l 2 >->lr-l>k ^2/ \ 2 ) \ 2 ) \ 2 ) \2) 

C' +2 -<r +i) ) m ^-%- 3) ) M (2) c-n ® 



Most binomials in the nominator and the denominator cancel. The summands turn out to be equal 
such that 

■ ■ ■ (2) ( k '~ 



V 7 Jl>h>i 2 >->ir-l>fc ' " V 2 ^ V ' 

/ n\fn- k\ Q ■ ■ ■ Q / ^ x 



(n-fc)-(n-fc-r + 2) 

(„-l)...(n-r) 1 j - 
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The computation of the second moment of Wk(r) follows in a similar way. Note that the event 
{{h, ■ ■ ■ , i r }, {ji, ■ ■ ■ ,j r } G TTfc} is nonempty only if the sets . . . , i r } and {ji, . . . ,j r } are identical 
or disjoint. Thus, for n > 2r 

eKM) = (")e (if,, r)e , (} ) + ( r ^ n "_ 2i ,)e(i {{1 r)etk , ■ i {{r+1 2r)ew) ) 

= (;)P({L ■■■•'} e n) + ( r>r _ „"_ 2r )p(d, • • • ,r>, {r + 1, . . . , 2r> e n t ) 

/ _ \ C»-2r\ (Ji'+2-2r\ /r\ /ii'-(2r-l)\ 

Vr,r,n-2ry^ (») ( f i+ 2 ) f^ 1 ) 

fcT) i fi-r 2 ) (V) 
'''fS 2+2 )'(S 2+ V (V) '"ft 1 ) 

where the sum is taken over all n > Zi > Z2 > • ■ ■ > K-i — & an( i all « > Z^ > Z 2 > ■ ■ • > l' r _i > fc 
such that {Zi, . . . , l r -i\ D {Z^, . . . , Z{.-i} = 0. The sequences (Zj)i<j< r _i and (Zj)i<j< r -i denote the 
coalescence times of the branches supporting leaves from the sets {1, . . . , r} and {r + 1, . . . , 2r} 
respectively. The sequence (Z")i<j<2r-2 is the reordering of Zi, . . . , l r -i, l[, ■ ■ ■ , l' r -\ in decreasing 
order. Thus 



E(Wi(r)) = E(W fc (r)) + 



r, r, n - 2r J \r - 1, r - 1, n - fc - 2r + 2/ (™) . . . ( n ~ 2r+1 ) V 2 



n-k \ ((D-©) 2 



= E^r)) + (n - fc) - n (n - fc -V 3) ^ - l) 2 (fc - 2). 
(n — 1) • • • (n — 2r) 

The variance of Wfc is then for k < n — 1 

V(W fc (r)) = E (W k (r)) (l + (n - fc - r + 1) • • • (n - fc - 2r + 3) _ _ 

\ (n — r — 1) ■ ■ ■ (n — 2r) 

(n - fc) • • • (n - fc - r + 2) _ ^ 
(n - 1) • ■ ■ (n - r) V ; 

< E (W fc (r)) + fc(fc - l)(n - fc) ■ ■ ■ (n - fc - r + 2) 



(n — r — 1) • • • (n — 2r) (n — 1) ■ ■ ■ (n — r) 



fc 2 

< c — 

n 



for c < 00. 

For the other claims we use the same type of argument as above. We have that 

( n V({l,...,r},...,{(a-l)r + l,...,ar}G7r fc ) 

\r, . . . ,r,n — ar J \ / 

n — k \ ((2) ■ ■ ■ (2)) /fc\ /fc— a+1 



r, . . . , r, n — ar J \ r — 1, . . . , r — 1, n — fc — a(r — I) J Q) " " " (™ ( 2 T+1 ) V 2 
(n — fc) ■ ■ ■ (n — fc — a(r — 1) + 1) 



2 



(n — 1) ■ ■ ■ (n — ar) 



fc(fc - l) 2 • • • (fc - ar + l) 2 (fc - ar) 
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{n - k)<^) , 2a _ A , ./(n- fc) a ( r-1 ) _1 -\ , _ An - fc)^ 1 ) ' 



+ O ± '- k +0\ ± '- k 4a \+0\ - k 



n"' \ n 



In particular this gives the asymptotic expansion of E(W^.(r)). Also 



n 



{l,...,r},...,{(a-l)r + l,...,ar} evr fc =0 — . (3) 



k 2 ° 



Moreover, by expanding f Ei<< 1 <...« r <n ,v}£^} 



a 



E(W fc a (r))= P {l,...,r},...,{(a-l)r + l,...,ar}€7r fc (4) 

\ r, . . . , r, n — ar ! 



' a-l 



V({1, . . . , r}, . . . , {(a - l)r + 1, . . . , ar} E n k 

+°(E( r) ... i ;_^({ 1 ^••• 1 «My + i,,M e , 

The last claim now follows from ©, ©, flU and the fact that YTp=\ (?) = (Ctt^ 1 ' ^ 



□ 

Remark. It can be read from the computation in ([1]) that in the case r = 2, given the 
event {p(l,2) < the random variable <r(l,2) is uniformly distributed on the set of levels 
{k, . . . , n — 1}. A similar observation can be made for the case r > 2. 

Using the numbers W k {r) we have the following simplified expression for the length of order r: 

C n > r = £ W fc (r)-X fc . (5) 

2<fe<n 

We note from this representation that there are two sources of randomness in the length of order 
r, one coming from the numbers W k (r) and one coming from the exponential inter-coalescence 
times. It is easy to see that taking out the randomness introduced by the exponential times (i.e. 
replacing them by their expectations) leads to an error that is asymptotically Op (n -1 / 2 ) and 



therefore converges to after the rescaling by w j^;. Indeed, by using the independence between 
the X k s and the Wk(r)'s and Lemma Q] we have that for a constant c < oo 



<k<n 

J2 E(W*(r))-E((X k -E(X k )) 



2<k<n 2<k<n 



2<k<n 



v-^ / k A k 2 \ 1 1 

z — " \ n z n J f™j n 



2<k<n 
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and therefore 

C n ' r = L n ' r + P {n- 1 /' 2 ), 

where 

L^:= £ W k (r)-E(X k )= £ W fc (r) ■ — (6) 

2<k<n 2<k<n ^ ' 

As a consequence, in the proof of our Theorem we need only focus on the length L n ' r which we 
will, for convenience, still call the length of order r. 



3 The coupling 

Our proof follows a coupling argument which substantially relies on the observation that for every 
s G N the vector V k := (W k (l), W k (2), . . . W k (s)) follows for n > k > 1 the dynamics of an 
inhomogenous Markov chain with state space X n ^ s := {0, 1, . . . , n} s . We let time run in coalescent 
direction (from the leaves to the root of the tree) and for convenience we consider the evolution of 
the chain (V k ) n > k >i, running in the same direction, namely from level n to level 1. 

For every 1 < r < s we denote by AW„_i(r), . . . , AWi(r) the sizes of the jumps of the chain 

{W k (r)) n > k >i, 

AW k (r) := W fc (r) - W k+1 (r), n - 1 > k > 1 

and observe that AW k (r) G {—2, —1,0, 1} for all k. The jumps of size 1 correspond to the levels at 
which a new branch of order r is formed (by the coalescence of two other branches), whereas the 
jumps of sizes -1 and -2 happen at the levels at which one or respectively two branches of order r 
end (by coalescence of one of them with some other branch or by mutual coalescence). 

For 1 < k < n and v, v' G X n ^ s let 

P„V) :=Pfa_i = t/ \V k = v) 

denote the transition probabilities of iy k ) n >k>i- They are given for v = (wi, . . . , w s ), w±+- ■ -+w s < 
k by 

if v = v' , 

if v = v' — ej for some i, 

(7) 

if v = v — 2ej + e2i for some i, 

if v = v' — ej — ej + e^+j for some i / j, 
else, 

where = (5i,j)i<j< s (note that ej = (0, ...,0) for z > s). For all other u G X n , s we set for 
definiteness P^(v') = 8 v y. 



lk—w\—W2- 



P v k (v') 



(I) 

Wi(k — W\ — W2 

w 

(1) 

WiWj 

15" 
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In particular for s = 2 the transition probabilities in ([7]) are given for w\ + W2 < k by 

/A— 1U1— W2\ 



V27 



(5) 



" l(fc -, Wl " W2) P* ..fun - 2,1*. + 1) = SI 



^ 2) k - 1^2) = lv , k : — ^ ^ 1>W2) k - 2^2 + 1) 



- 1^2 - 1) = -j^ P( WuW2) (™i^2 ~ 2) 



and for s = 1 for it; < k by 

^H = ^, P>-1) = ^^ 5 P^-2) = i|. (8) 
I2J I2J I2J 



Let us now describe the coupling in detail. Let 1 < a n < n and s £ N be fixed. Starting at a n 
we couple the Markov chain (Vk) a >&>i, Vk = (Wfc(l)> • • • , Wk(s)), with another chain ( Vk 



a n >k>l 

Vk = (Wfc(l)> • • • > Wk{s)), defined on the same probability space as (Vk) an> k>i and having the same 
state space X ns . The components of \ Vk] evolve as independent copies of (Wfc(l)) >i.>i, 

the Markov chain of external numbers. Therefore its transition probabilities P^(-) for v £ X H:S and 
1 < k < a n are given by the product of the transition probabilities of its s components, given in 



(J8J). The process I I Vk, Vk)) is constructed as a Markov chain, where the jumps are coupled. 

Thus let Q k u and Q~ denote the conditional distributions of the jumps AVk and AVk of the two 
Markov chains, given the current states v and v respectively. (The notations AVk and AVk refer 
to component- wise differences). For the sequel it is important that the leading terms of Qy and Q~ 
agree. More precisely, from ([7]) and (jSJ), under the constrain that w\ + • • • + w s < k 



Q k v (z),Q k v (z) 



2wi 



2w, 



+ 



+ wi 



wj + 



O 



wj + 



k 2 



if z = (0,...,0), 



if z 



else, 



-e,- for some i. 



0) 



where z 6 {—2, —1,0, 1} S and ei = (#i,j)i<j< s . Here we use that WiWj < wf + w 2 and wi < k. 

As it is well-known (see for example [LP W09j ) . an optimal coupling of the two distributions is 
specified as follows. Let || • \ \tv denote the total variation distance between two distributions and 
define 

P = Pv,v ■= 1 



Qv Qv 



TV 
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Then, with probability p choose AV k = AV k = Z, where the random variable Z has distribution 
77, given by its weights 

Q k v (z)AQl(z) 
P 

z £ {—2, —1, 0, 1} S and with probability 1 — p choose AV k according to the probability distribution 
weights 

(Q k v (z)-Qv(z)] 
in(z) = — 



1 — p 

and independently choose AV k according to the probability distribution weights 



_ Qv(z) - Q k v (z] 

7iii(z) 



1-p 

ze{-2,-i,o,i} s . 

This coupling is optimal in the sense that the probability P ^AVfc ^ AV k | \% = v, V k = v\ is 
minimal among the corresponding probabilities for couplings of the two distributions Q\ and Q~, 
and therefore it is equal to Q k — Q~ . As starting distributions of the coupled chains (Vfc) an>fc>1 



and I V k ) we allow any distribution of I V an , V an I such that the marginals are the distribu- 

V / a n >k>l V / 

tions of V an and V an respectively. 

The next two lemmas give essential properties of the coupling. 

Lemma 2. There is a c < oo such that the above defined coupling satisfies for r < s 

p(AV, lM #A^( r ))< C (^ + ^ + i) 

and 

E (\W k (r) - W k (r)\) < c f-*L + ^ + 1 

for all 1 < k < a n . 



Proof. In the proof we write shortly W k instead of W k {r) and similarly W k , AW k and AW k instead 
of W k (r), AWk(r) and AW k (r) respectively. 



From ([9]) it follows that for both the chains (Vk) an >k>i an< ^ ( ^ ) jumps of sizes (0, . . . , 0) 

and — ei with 1 < i < r occur with probabilities of larger order than jumps of other sizes. It follows 
from the coupling that 

{AW k ^ AW k } C {AW k = -1, AV k ± AV k } 
U{AW k = -l,AV k ^AV k } 
U {AW k £ {1, -2}} U {AW k = -2} 

Thus, writing shortly P^^(-) for the conditional probability given the event {V k = v,V k = v}, we 
obtain 



11 



P^t 1 [AW k ± AW k j < (1 - P ) ln (AW k = -1) + (1 - P ) lHI (AW k = -1) 

+ P^J 1 (AW k G {1, -2}) + P^t 1 (&W k = - 

< (1 - p)7//(AVfc = -e r ) + (l-p)7m(AH = - 

+ P^t 1 (Aiy fc G {1, -2}) + p^t 1 (a^ = - 

< (1 - p)7Jj(AV fe = -e r ) + (1 -p) 7m (AVfc = -er) 

Using now the definitions of jjj and jjjj we get that 

P^ 1 [AW k + AW k ) < \Qt +1 (-e r ) - Qt l {-e r )\ + c • ^' =1 ^ + J ■ (10) 
Let us introduce the filtration F = (J r k )i< k < an with J 7 ^ C J an _i C • • • C F\ defined by 

Fk = ° ((yj)k<j<a n , {Vj)k<3<ani ) • 

Then (|10p in view of ([9]) may be written as 



AW fc + AW k | J- fc+1 J < - • \W k+1 - W k+1 \ + c ^ p '- (11) 

for a constant c < oo. Taking expectation in the equality above we obtain using Lemma Q] 

AW k £ AW k ) < ^ • E (\W k+1 - W k+l \) + c ^ + I V (12) 



We now proceed to finding a bound for E \ \W k — W k \j for 2 < k < a n . From the transition 
probabilities ([7]) we get that 

n^w k i T k+1 ] 

W k+l {r){k + 1 - W fc+ i(l) Wfc+i(r)) 



W fc+ i(r)W fc+ i(l) + --- + W fc+ i(r)W fc+ i(r-l) (^ + 2 l(r) ) 



2 -w»i + %£ 



k + i ^ (^+1) 

where, letting d r = 1 if r is even and otherwise, 

Z fc = Z fc (r):= ]T W fc (i)W fc (r-i) + dr- f^^Y (13) 



l<i<r— 1 
i^r — i 
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Therefore 

E[Aiy fc I F k+1 ] = --^-W k+1 + ^ (14) 

[ 2 ) 

and also 

E[A^ fc | F k+1 ] = -^-^W k+l . (15) 

Now note that the absolute value of the difference between the jumps of W^+i and W k+ \ is at most 
3. Thus 

n\W k - W k \ | F k+1 ] = E[\W k+l - W k+1 + AW k - AW k \ \ F k+1 ] 

< E[W k+1 - W k+1 + AW k - AW k | F k+1 ] • l {Wk+l _w k+1 >3} 
+ E[W k+1 - W k+l + AW k - AW k | F k+1 ) • l {Wfc+1 _H? fc+1 <_ 3 } 
+ (\W k+1 - W k+1 \ + E[\AW k - AW k \ | F k+1 }) ■ l { |vK fc+1 -H7 fc+1 |<2}- 
Using (TJ3D and (fTo"j) we obtain 

n\W k - W k \ | F k+1 ) < (w k+1 - W k+1 - (w k+1 - W k+l ) + ■ V fc+1 _^ fc+1 > 3} 

+ (^i - w ™ - itt " Wk+; ) ~ fKy) • W-«w<-s> 

+ (\W k+1 - W k+1 \ + 3 • F(&W k ? AW k | F k+1 )) ■ l {lWk+1 _w k+1 \<2y 

By (HU we have that 

n\w k -w k \ i F k+1 \ 



+ ( |^ +1 -^ +1 | + -|^ fc+1 -w fc+1 |+ c v _ /. . i {m+i _ Wk+im 



<Ww-%i 1-rrr +t + « " r, L + 



k + lj k k 2 ( fc + x ) ' 

Taking expectation and using Lemma [T] we obtain that 



E(|W* - Wfc|) < ( 1 - 7^ ) E[\W k+1 - W k+1 \] + c ( — + 



Dividing the previous inequality by k(k — 1) we obtain a recurrence formula that we iterate from 
k up to a n — 1: 
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1 



k(k- 1) 



n\w k -w k \) < 



i 



k(k + i) 



E(\W k+1 -W k+1 \) + c 



fc 2 1 

n 2 k 



< 



a n (a 



i a "~ 1 /i i \ 



< 



1 



a„(a n - 1) 
Finally by Lemma [1] 
1 



E (|W an -E(W an )|)+ E(Ty 0n )-E(W 0n ) +E(|W an -E(W an )|))+c(^+ 1 



2 /c 2 / ' 



fc(fc-l) 



E(|W fc - W fc |) < c 



a n (a n — \)\y/n n 2 n 



a n + _}_ 

n 2 k 2 



(16) 



< c 



1 



+ -» + 



1 



a n y/n n 2 k 2 ) 

This gives the second claim of the Lemma. Using now (|16|) in (|12|) yields the first claim. 
Lemma 3. There is a constant c < oo such that for r < s it holds that 



□ 



V (W k (r)-W k (r)) <c 



+ 



+ + 1 

a n n 



for all 1 < k < a n . 



Proof. We again write here shortly W k , W k , AW k and AW k instead of W k (r), W k (r), AW k (r) and 
AW k {r) respectively. 

Using ([T3|) and ([15]) together with the fact that \AW k _i - AW k -i\ < 3 we obtain 



V(W fc _i - W k -i) = V ( W k - W k + AWVi - A^ fe _x 

< V ( w k - w k 

+ 2E( E ( W fc - W k - E( W fe - W k ) ) ( AW k -i - Alf w - E( AW k ^ - AW k -i 



T k 



+ E AWfc-i- AW fc _i 



1 - ^ ) V ( Wfc - W fc ) + 2E ( ( W k - W k - E( W k - W k 



+ 9P(W fc _i ^ AWV 



^fc ~ E(Z fc ) 



Applying now the Cauchy-Schwarz inequality for the second term and Lemma [2] for the third term 
on the right-hand side of the inequality above we obtain that for a constant c < oo 



V 



(V fc _i - w k . 



i < l-r V [W k -W k ) + 



k(k 



(Y(w k -W k )) 1/2 •(Y(Z k )) 1 l> 



+ c 



k Qiri k 1 



a„ \fn n 2 k 



(17) 
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Let us now look closer at the variance of Z\.. In order to bound it from above, it is sufficient to 
bound the terms of the form ^(W k (i)W k (j)), 1 < i,j < r (see the definition of Z k in (PT3j) ). Writing 
shortly W' k and W' k for W k (i) and W k (j) respectively we have that 

nW k (i)W k (j)) < Efawg - E(W£)E(W£) 

= e((^ - nw' k ))w k ' + E(wfi<wz - nwn)) y 

< 2E((W£ - E(I^)) 2 «) 2 + {E{W' k )f{W' k ' - nW k ')f 
Using the fact that Wj! < k and then Lemma Q] we obtain 

V(W fc (i)W fc (j)) (18) 

< 2E((W^ - E(^)) 2 « - E«)X) + 2E((W^ - E(^)) 2 E«)<) 

+ 2(E(^.)) 2 V(0 

< 2e((W£ - E{W' k )f(W' k ' - E{W' k r f) + 2E((W£ - E(^)) 2 « - E«))E(W£)) 

+ 2A; • E((W^ - E(^)) 2 E«)) + 2(E(^)) 2 V«) 

< 2E((Wi - E(W' k )f(W' k ' - E«) 2 ) + 4k ■ E((W k - E(^)) 2 E«)) + 2(E(^)) 2 V«) 



< 2E( (W' k - E(W' k ))\W' k ' - E«) 2 ) + c- 2 



for a constant c < oo. Moreover 
E([W' k -E{W' k )f 

= E((W' k f ) - AE{{W' k f)E{W' k ) + 6E((^) 2 )(E(^)) 2 - 4E{W' k )(E(W' k )f + {E{W' k ))\ 
Inserting now the formulas from Lemma [T] the leading terms cancel and we obtain that 



e((^-e ( ^)) 4 )=o(^ + ^). 



(19) 



Using the Cauchy-Schwarz inequality and (|19h in (|18h we get that 

/ k e k 5 k 2 
nW k (i)W k (j))<c(- + - + - 
\n n z n 

and therefore 

/ 1.5 p 

Y(Z k )<c(^ + - 
for some constant c < oo. Plugging this into (|17j) we obtain that 

V (Wn - W^) < (l - j\ v (w k - W k ) (20) 



n ky/n I \a n y/n n 2 k 
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Observe that 



r{y{w, - n,)) 1/2 -f — + ^= i < < 



2c 2 ( ^ A 



+ ^ 1 else 



and therefore (1201) becomes 



We now divide both sides by ( 3 ) and iterate up to a n . Since V ( W an — W an J < c^, we obtain 



(V) V ' n 2 j 



1 ( 3 a n j . 1\ , 1 ^ 



and therefore 



V Wfc_i-W fc _i <c-k 



1 a„ 1 1 
I — H 1 

ka n y/n kn 2 k 3 a n n 



Li2 _ r,2 l3 

, « «nre re 
< c • I f= H 7T- H h 1 



a„\/n a n n 



This is the claim. 



□ 



4 Proof of the Theorem 

The proof that // r , the expected length of order r, is equal to ~ for every r > 1 can be found 
in [Be09j, Theorem 2.11 or in |Du08j , Theorem 2.1. Another quick way to see this is by using 
Lemma [TJ 

E(£-) = E( ± W k (r)X k ) = ± E(W h (r))E(X k ) = ± ^^fe^f^ ' *(* - D ' 7^ 



2 



(n — 1) • • • (n — r) 



I^i(i + i)---(i + r-2). 



J'=l 



The claim follows now from the fact that Y^j=i 3 U + 1) • • • (j + *) = ^p2 n ( n + 1) • • • (n + i + 1). 

The asymptotic normality of the total external branch length of the Kingman coalescent (case 
s = 1) was proved in |JKllj . We will prove the theorem for s > 2. 

For 1 < r < s we divide L n,r and the corresponding coupled quantity into parts. For 1 < b n < 
o-n < n let 

E ijrnj-"'* - d E < 21 > 

b n <k<a„ b n <k<a„ 
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be the length of order r collected between the levels b n and a n in the coalescent tree and the 
corresponding quantity obtained from the coupling. Note that L™\ = L n,T with L n,r defined in ([6]) 
and let similarly 

(22) 



rn,r r n > r 

L ■- L n,V 



Using the coupling we will show that for e > 



n 



log n 



I/ 1 ' 1 — ElL 71 ' 1 ),..., L n,s — E(L n,b 



(23) 



L n > — ElL 



n.l 



, L n ' s — E( L ny 



> e) — >• 0, as n — > oo. 



Once (|23|) has been proved, the claim of the theorem follows since the components of the second 
vector above are from construction independent and identically distributed and they converge 
weakly to the standard normal distribution as n — > oo, as follows from the case s = 1 proved in 

[jktt] . 

The convergence in f|23|) is a direct consequence of the following result. 



Proposition 1. For L n ' r and L n,r defined in |6]) and \22\) respectively one has for all 1 < r < s 

and e > 



logn 



• (V' r - E(L n > r )) - (L n ' r - E(L n >' 



> e ) — > 0, as n — > oo. 



(24) 



Proof. We have by the Cauchy-Schwarz inequality that 



v E 



k(k-l) 



W k -W k - (E(W k ) - E(W k 



< 



b n <k<a n 

E E 

b n <k<a n b n <l<a n 

E E 

b„<k<a n b n <l<a n 

2 

m 

b n <k<a n 

Using Lemma [3] we obtain 



k(k- 1) 1(1 - 1) 



COV [W k -W k ,Wi-W l 



E 



k(k-l) 



k(k-l)l(l-l) 



■ v w k - w. 



V( Wjfe -Wjfe) Y(W l -W l 



1/2 



v E 



, b n <k<a n 



k(k-l) 



W k -W k - (E(W fc ) - E(VK fc ) 



(25) 



< c 



< c 



b n <k<a n 
1 



A: 2 



/a n n 



l l 

// ' b n ' \/n ' b, 



, ((l nVl l0g 6: + ^- l0g - + - + - 
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In order to show that the claim holds we consider three regions in the coalescent tree, namely 
between level n and level jj^^i, between level n^^p an d level n 1 / 2 and finally between level n 1 / 2 

and level 1, and write the lengths L™'^ and L™'^ as sums of the lengths gathered in these three 
regions. 

For the first region let 

a n = n and b n = 

(log n) z 

We obtain from (|25p and Chebyshev's inequality that 

L n '\„ ~ E fe r J = - E (lYA + P ( log log " 



a n ,b n \ a n ,b n j a n ,b n \ a n ,b n 

and therefore 

n 



L n / h ~ 1 ( L n f h ))-( L n / h - E ( L n / h )))^0 (26) 

a„,b„ y a„,b n J J y a„,b„ y a„,b n J J J y ' 



log n 

in probability as n — > oo. 

The second region we consider is the one between the levels a n and b n with 



(log n) 



and b n = n 1 / 2 . 



We put together the coupling for the two regions by taking the starting distribution for the second 
region to be the distribution of the chain at the end of the first region. Again from (|25p we get that 



L n '\ - E L£V = LY. ~ E LY. ) + P 



and therefore 



" (tL n >\ -E(L n ' r h ))-[L n '\ -E(L n ' r , )))->0 (27) 

a n ,b n \ a n ,b„ II \ a„,b„ \ a n ,b n III ' 



log n 

in probability as n — > oo. 

For the region in the coalescent between the levels n 1 / 2 and 1 we claim that 



and 



as n — > oo. The second claim follows directly from Proposition 3 in [JKllj . whereas for (|28p we get 
similarly using Lemma [T] that 

E(L n ^ r b )=E\ y Wk -—^—)<c y ¥-.—±—< c .^, (30) 
V a n ,b n J I k k(k-l) - ^ n k(k-l) ~ n' v 1 

\b n <k<a n I b n <k<a n 
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for some constant c < oo. Therefore, setting a n = n 1 / 2 and b n = 1 in (|30|) . we obtain our claim 



28]). Since both yj^i ■ ^ n i/2 x an< ^ y Egn ' i are P°siti ve random variables, it follows from 
28|) and ([29]) respectively, that 



" ^X^O and J-^-.L^^O (31) 



logn « 1/2 . 1 ' Vl°g n ™ 1/2 ' 1 

in probability as n — > oo 



Writing 



^ — ^n,l — — — — ry + ^ n l/a + ^„l/2 1 



rn,r rri,r T n,r . T n,r . T n,r 

L ~ L n,i ~ K,— 2_ + ^ a 1/2 + 



and 



and using (j26|) - (|29l) and (I3ip we get the claim of the proposition and therefore our theorem is 
proved. □ 
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